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Abstract. — We derive a lower bound for the subword complexity of the 
base-fe expansion (b > 2) of all real numbers whose irrationality exponent is 
equal to 2. This provides a generalization of a theorem due to Ferenczi and 
Mauduit. As a consequence, we obtain the first lower bound for the subword 
complexity of the number e and of some other transcendental exponential 
periods. 



1. Introduction 

The decimal expansion of real numbers like y/2, tt or e appears to be quite 
mysterious and, for a long time, has baffled mathematicians. While numerical 
observations seem to speak in favour of a complex structure, most questions 
one may imagine to ask about the decimal expansion of classical irrational 
constants turn out to be out of reach. 

Kontsevitch and Zagier |18j offered a promising framework to try to distin- 
guish usual constants from other real numbers by introducing the notions of 
period and of exponential period. Algebraic numbers, tt, log 2 and £(3) are pe- 
riods, while e is conjecturally not a period. However, e is a typical example of 
an exponential period. Exponential periods form a countable set that contains 
the set of periods. We refer the reader to [18j for exact definitions and more 
results about both notions, but to paraphrase these authors, all classical con- 
tants are periods in an appropriate sense. Folklore suggests that all irrational 
periods are normal numbers. Recall that a real number is a normal number if 
for every integer b > 2 and every positive integer n, each one of the b n blocks 
of length n over the alphabet {0, 1, . . . , b — 1} occurs in its base b expansion 
with frequency 1/6™. This notion was introduced in 1909 by Borel who 
proved that almost all numbers, with respect to the Lebesgue measure, are 
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normal despite the uncomfortable fact that not a single natural example of a 
normal number is known. 

An interesting (and perhaps more reasonable) way to tackle problems con- 
cerned with the expansions of classical constants in integer bases is to consider 
the subword complexity of real numbers. Let £ be a real number and b > 2 be 
a positive integer. Then £ has a unique expansion in the base b, that is, there 
exists a unique sequence a = (a ra )„>_fc with values in {0, 1, . . . , b — 1} such 
that 

£_ V — 

n>—k 

:= ci-k a -k+i • • • a-iao«aia2 • • • 
The complexity function of £ with respect to the base b is the function that 
associates with each positive integer n the positive integer 

p{£,b,n) := Caxd{(aj,a j+ i, . . . ,a i+n _i), j > 1}. 

A normal number thus has the maximum possible complexity in every integer 
base, that is, p(£, b, n) = b n for every positive integer n and every integer b > 2. 
As mentioned before, one usually expects such a high complexity for numbers 
like v2, 7T and e. This problem was first addressed in 1938 by Hedlund and 
Morse [19]. 

In their paper, Hedlund and Morse obtained a fundamental result that can 
be restated as follows. 

Theorem HM. — Let b > 2 be an integer and £ be a real number. Then £ is 
rational if and only if it has a bounded complexity function. Furthermore, if £ 
is irrational, its complexity function is increasing and thus 

b, n) — n > 1, Vn > 1. 

To find lower bounds for the complexity function of algebraic irrational 
numbers is a challenging problem. In 1997, Ferenczi and Mauduit [14J proved 
the theorem below. Actually, their result is slightly weaker and the present 
statement is given according to a clever remark of Allouche outlined in [6]. 

Theorem FM. — Let b > 2 be an integer and £ be an algebraic irrational 
number. Then, 

(1) lim p(£, 6, n) — n = +oo. 

n— >oo 

The proof of Theorem FM mixes techniques from combinatorics on words 
and Diophantine approximation. The main ingredient is a p-adic version of 
Roth's theorem due to Ridout [20] . Recently, Bugeaud and the author pQ (see 
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also [4]) improved Theorem FM by means of the Schmidt Subspace Theorem. 
Under the same assumption, these authors proved that (fTJ) can be replaced by 

lim P ^ An) = +oo. 

n— >oo n 

The situation regarding transcendental constants is even worse: apparently, 
there is not a single trancendental exponential period for which one knows 
an improvement of the lower bound given by Theorem HM. Of course, one 
could choose an exponential period £ and compute the first digits of its base-6 
expansion. If one is lucky enough, one will find occurrences of many different 
blocks of digits of a given length. But, this would only lead, for some integer 
k, to a lower bound of the type 

b,n) — n > k, 

for all sufficiently large positive integers n. 

Recall that the irrationality exponent of an irrational number £, denoted by 
/i(£), is defined as the supremum of the real numbers p for which the inequality 



(- p - 

q 



1 

< — 

qP 



has infinitely many different rational solutions p/q. It always satifies 

2 < n(£) < +oo. 

The set of real numbers whose irrationality exponent is equal to 2 has full 
Lebesgue measure. By Roth's theorem [21J, algebraic irrational numbers all 
have an irrationality exponent equal to 2. The aim of this note is to generalize 
Theorem FM as follows. 



Theorem 1. — Let b > 2 be an integer and £ be an irrational real number 
such that = 2. Then, 

(2) lim b,n) — n = +oo. 

n— too 

Our proof of Theorem [T] is essentially a combination of known results that 
rely on fine combinatorial properties of infinite words with a very low complex- 
ity. In particular, we use, in an essential way, a result due to Berthe, Holton 
and Zamboni [10] concerning initial repetitions occurring in Sturmian words. 

We derive the following consequences of Theorem [TJ 
Corollary 2. — For every integer b > 2, the number e satisfies (0). 
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To our knowledge, this is the first example of a transcendental exponential 
period for which we can improve the bound of Theorem HM. 

The only property of the number e used in the proof of Corollary [2] is 
that //(e) = 2, which follows from Euler's formula for the continued fraction 
expansion of e (see the proof of Corollary [2] in Section [3|) . Actually, many 
other examples of numbers involving the exponential function, trigonometric 
functions, or the modified Bessel function at rational arguments also have an 
irrationality exponent equal to 2. In particular, the same conclusion holds in 
Corollary [2j if we replace the number e by any of the following numbers (see 
US S3]): 

e a , a G Q,a / 0; 

tan ( — ) , -v/atan ( —= ] , —= tan ( —= ] , o£N,a/0; 
W W a J V« VvV 

tanh ^-^ , a6N,a/0; 

V tanh ( I , u, f G N, uv ^ 0. 



Other interesting values covered by our approach are the numbers 

J( P / q)+ i(2/q) J(p /g)+1 (2/ g ) 
^(2/9) Ip/ fl (2/g) ' P/<7€Q ' 

(iz/2) 2n 

where J\(^) = ( - ) > , ^ ,, r denotes the Bessel function of the 

; V2/ ^ re T(A + re+l) 

n=0 v 7 

+°° ( z /2) A+2n 

first kind and I\(z) = > — ; — 7 r denotes the modified (or hyper- 

^-^ re! Ti A + 71 + 1) 

n=o v ; 

bolic) Bessel function of the first kind (see for instance |17j ). 

Furthermore, multiplying any of these numbers by a nonzero rational and 
then adding a rational leads to a new example for which our result can be ap- 
plied. We can also take the natural action of GL2(Z) on any of these numbers. 
That is, starting from one of the above number £, the number (a£ + 6) / (c£ +d), 
where \ad — bc\ = 1, also satisfies the bound ([2]) of Theorem [TJ 

Another consequence of our Theorem[T]is that a real number with a bounded 
sequence of partial quotients in its continued fraction expansion cannot have 
base-6 expansions that are too simple. 
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Corollary 3. — Let b > 2 be an integer and £ be an irrational real num- 
ber whose continued fraction expansion is [ao, ai, a,2, ■ ■ ■}■ If the sequence of 
integers (a n ) n >o is bounded, then £ satisfies (dj). 

2. Repetitions in infinite words with a low complexity 

We first introduce some notation from combinatorics on words. 

Let A be a finite set. The length of a word W on the alphabet A, that is, 
the number of letters composing W, is denoted by \W\. If a is a letter and W 
a finite word, then \W\ a denotes the number of occurrences of the letter a in 
W . For any positive integer k, we write W k for the word 

W---W 

S v ' 

k times 

(the concatenation of the word W repeated k times). More generally, for any 
positive real number x, W x denotes the word W^W, where W' is the prefix 
of W of length \(x — L^J)!^!! • Here, |_^J and \x] denote the floor and ceiling 
functions, respectively 

We now consider two exponents that measure repetitions occurring in infi- 
nite words. They were introduced in |10] and [2] (see also [5]), respectively. 
The first exponent, the initial critical exponent of an infinite word a, is defined 
as the supremum of all positive real numbers p for which there exist arbitrarily 
long prefixes V such that V p is also a prefix of a. The second exponent, the 
Diophantine exponent of an infinite word a, is defined as the supremum of the 
real numbers p for which there exist arbitrarily long prefixes of a that can be 
factorized as UV W , where U and V are two finite words (U possibly empty) 
and w is a real number such that 

\UV W \ 

Jiv\- p - 

The initial critical exponent and the Diophantine exponent of a are respec- 
tively denoted by ice(a) and dio(a). Both exponents are clearly related by the 
following relation 

(3) 1 < ice(a) < dio(a) < +oo. 

Recall that the subword complexity function of an infinite word a = a\a^ ■ ■ ■ 
is the function that associates with each positive integer n the positive integer 

p(a,n) := Card{(aj,a j+1 , . . . ,a j+n -i), j > 1}. 
We now prove the following result. 
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Proposition 4- — Let a be an infinite word such that the difference p(a, n) — 
n is bounded. Then 

dio(a) > 2. 

Proof. — Our first step is a reduction argument that was previously outlined 
by Allouche in a similar context [6j. More precisely, we are going to prove 
that it is sufficient for our purpose to focus on the Diophantine exponent of 
Sturmian words. Sturmian words are defined as the binary words s for which 
p(s, n) =n + l for every positive integer n. Recall that the set of Sturmian 
words is uncountable. 

From now on, we fix an infinite word a = a\a 2 ■ ■ ■ defined over a finite al- 
phabet A and such that the difference p(a, n) — n is bounded. If a is eventually 
periodic, it is easily checked that dio(a) is infinite. We can thus assume that a 
is aperiodic. Theorem HM thus implies that the subword complexity function 
of a is increasing. Consequently, the difference p(a, n) — n is a nondecreasing 
sequence of bounded integers. Such sequence is eventually constant and there 
thus exists two positive integers k and no such that 

(4) p(a, n) = n + k, Vn > uq. 

By a result of Cassaigne [13j . an infinite word a satisfies Equality (|3J) if and 
only if there exist a finite word W, a Sturmian infinite word s defined over 
{0, 1}* and a nonerasing morphism (p from the free monoid {0, 1}* into A* 
such that 

(5) a = Wip(s). 

Our infinite word a thus has such a decomposition and we claim that 

(6) dio(a) > dio(s). 

Set s = S1S2 • ■ ■ ■ To prove that (jSJ) holds, we only need a classical property of 
Sturmian words: each of the two letters occurring in a Sturmian word has a 
frequency. This means that there exists a real number a in (0, 1) such that 

,. |siS2-"S n |i 

lim = a, 

n— >oo n 



and consequently, 



,. \S\S2---S n \o 

lim = 1 — a. 



n— voo n 

The number a is always irrational and is termed the slope of s. It follows that 
(7) \ip(sis 2 - ■ ■ s n )\ = 5n + o(n), 

where 5 := (or|y?(a)| + (1 — a)\ip(b)\). Here and in the sequel, o stands for the 
usual Landau notation. 

Now, let us assume that dio(s) = p for some positive real number p, and let 
e be a positive real number. By definition of the Diophantine exponent, there 
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exists an infinite sequence of prefixes of s that can be factorized as U n V™ n , 
with \U n V™ n \/\U n V n \ > p—e and such that the sequence (\U n V n \) n >i increases. 
We then infer from ([5]) that a begins with the word 

Wip{U n V^). 

Set A n = Wip(U n ) and B n = (p(V n ). There thus exists a positive real number 
r n such that 

Wtp(U n V?») = A n B?. 
Since |C/ n 1 ^n| can be chosen arbitrarily large, we infer from Equality ([7|) that 

and 

\<p(U n V n )\=6\U n V n \ + o(\U n V n \). 

Consequently, 

\A n B r n -\/\A n B n \ > P -2e, 
for every n large enough. This shows that 

(8) dio(a) > p = dio(s) 
as desired. 

We have now to distinguish two cases depending on the Diophantine prop- 
erties of the slope a of the Sturmian words s. Let us denote by [0, mi,m2, . . .] 
the continued fraction expansion of a. 

First, let us assume that a has a bounded sequence of partial quotients, 
say bounded by a positive integer M. Then there are only a finite number of 
distinct pairs (aj,aj + i) and a fortiori of distinct triples (oj, ctj+i, 0^+2) ■ By 
the pigeonhole principle, there exist either a pair of integers (s, t), 2 < s < M, 
1 < t < M, such that (a,-, a,- 4.1) = (s,t) for infinitely many indices j, or there 
exist infinitely many indices j such that (oj, o^+i, aj+2) = (1, 1, 1)- In all cases, 
mixing Propositions 5.1 and 5.2 of [lOj we get that 

iCe < S » £2+ 2(M + 1 l) 2 + l >2 ' 
We thus infer from Inequality (|3|) that 

(9) dio(s) > 2. 

On the other hand, if a has an unbounded sequence of partial quotients, it 
is shown in Proposition 11.1 of |3j that 

(10) dio(s) = +00. 
To sum up, (HD, © and (HO]) give 

dio(a) > 2, 

concluding the proof. □ 
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3. Diophantine exponent and rational approximations 

We now briefly recall some interplay between the Diophantine exponent and 
the irrationality exponent that can be found in [3]. Let £ be a real number 
whose base-6 expansion is O.ai 02 ■ • • . Set a = a\02 • • • ■ Then the Diophantine 
exponent of a and the irrationality exponent of £ are linked by the following 
inequality: 

(11) dio(a) < 

Indeed, let us assume that the word a begins with a prefix of the form UV W . 
Set q = b\ u \(b\ v \ — 1). A simple computation shows that there exists an integer 
p such that 

p/q = 0.UVVV-- - . 

Since £ and p/q have the same first |C/V rw; | digits, we obtain that 



q 



< 



and thus 
(12) 



1 



where p = \UV W \/\UV\. We do not claim here that p/q is written in lowest 
terms. Actually, it may well happen that the gcd of p and q is quite large 
but ()12p still holds in that case. Inequality (|lip then follows directly from the 
definition of both exponents. 

We are now ready to conclude the proof of our main results. 

Proof of Theorem [7J — It is a straightforward consequence of Proposition U] 
and Inequality ([IT]) . □ 



Proof of Corollary [1| — It is known after Euler[W| that the continued fraction 
expansion of e has very special patterns; namely 

(13) e = [2, 1, 2, 1, 1,4, 1, 1, 6, 1, 1, 8, ... , 1, 1, 2n, 1,1...]. 

From Euler's formula, we can easily derive that the irrationality exponent of 
e satisfies //(e) = 2, concluding the proof. Indeed, if q n and a n respectively 
denote the re-th convergent and the n-th partial quotient of a real number £, 
the theory of continued fractions ensures that 

log a n+ i ' 



//(£) = limsup I 2 + 



log<?n 

□ 



1. Actually, this formula seems to have been discovered first by R. Cotes; Euler would be 
the first to give a proof. 
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Proof of Corollary — A basic result from the theory of continued fractions 
ensures that = 2 when £ has a bounded sequence of partial quotients 
in its continued fraction expansion (see for instance |16j . Theorem 23, Page 
36). " □ 



4. Comments 

We end this note with few comments. 

Remark 5. — As already mentioned, the main ingredient in the proof of 
Theorem FM is a p-adic version of Roth's theorem due to Ridout. Incidentally, 
our proof of Theorem [T] provides a new proof of Theorem FM. In particular, 
this shows that it can be obtained with the use of Roth's theorem only. That 
is, p-adic considerations are unnecessary to prove Theorem FM. 

However, the use of some p-adic information by Ferenczi and Mauduit in 
|14j turned out to be of great importance since it led to the main lower bound 
for the complexity of algebraic irrational numbers obtained in pQ . 

Remark 6. — It is likely that Proposition U] could be improved to 
(14) dio(a) > i±^?. 

This result would be optimal since the bound is reached for the Fibonacci 
word, which is the most famous example among Sturmian words. We note 
that, as a consequence of irrationality measures obtained by Baker in [9], 
Inequality (|14p would permit us to show that the conclusion of Corollary [2] 
still holds if we replace the number e by log(l + 1/n), for every integer n > 68. 
However, our approach would not permit us to deduce a new lower bound for 
the complexity of expansions in integer bases of periods like it, log 2 or £(3) 
from (|14p since the best known upper bounds for the irrationality exponent of 
these numbers are all larger than (3 + y/b)/2. 

Remark 7. — As a limitation of our approach, we quote that there are real 
numbers with a low complexity in some integer base but with an irrationality 
exponent equal to 2. For instance, it was proved in 122} that the binary number 

rc>0 

has bounded partial quotients in its continued fraction expansion, while a 
classical theorem of Cobham implies that p(£,2,n) = 0(n) (see for instance 
Corollary 10.3.2 of [8j, Page 304). Actually, one can deduce from the proof of 
Lemma 2.4 in [15j the more precise upper bound p(£, 2, n) < (2 + ln3)n + 4, 
for every positive integer n. 
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Remark 8. — It would also be very interesting to investigate the complexity 
of expansions of e and other contants from a computational point of view. In 
this direction, we pose the following open question. Note that similar open 
questions were posed by Allouche and Shallit [8], Page 402, Problems 3 and 
4. 

Problem. — Prove that the decimal expansion of e cannot be produced by a 
finite automaton. 
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